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Abstract 

In the capacitated survivable network design problem (Cap-SNDP), we are given an undirected 
multi-graph where each edge has a capacity and a cost. The goal is to find a minimum cost subset 
of edges that satisfies a given set of pairwise minimum-cut requirements. Unlike its classical special 
case of SNDP when all capacities are unit, the approximability of Cap-SNDP is not well understood; 
even in very restricted settings no known algorithm achieves a o(m) approximation, where m is the 
number of edges in the graph. In this paper, we obtain several new results and insights into the 
approximability of Cap-SNDP. 

We give an O(logn) approximation for a special case of Cap-SNDP where the global minimum 
cut is required to be at least R. (Note that this problem generalizes the min-cost A-edge-connected 
subgraph problem, which is the special case of our problem when all capacities are unit.) Our 
result is based on a rounding of a natural cut-based LP relaxation strengthened with knapsack- 
cover (KC) inequalities. Our technique extends to give a similar approximation for a new network 
design problem that captures global minimum cut as a special case. We then show that as we move 
away from global connectivity, even for the single pair case (that is, when only one pair (s, t) has 
positive connectivity requirement), this strengthened LP has integrality gap. Furthermore, in 
directed graphs, we show that single pair Cap-SNDP is 2 log "-hard to approximate for any fixed 
constant 5 > 0. 

We also consider a variant of the Cap-SNDP in which multiple copies of an edge can be bought: 
we give an O(logfc) approximation for this case, where k is the number of vertex pairs with non- 
zero connectivity requirement. This improves upon the previously known 0(min{fc, log -R max })- 
approximation for this problem when the largest minimum-cut requirement, namely i? max , is large. 
On the other hand, we observe that the multiple copy version of Cap-SNDP is f2(loglogn)-hard to 
approximate even for the single-source version of the problem. 
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1 Introduction 



In this paper we consider the capacitated survivable network design problem (Cap-SNDP). The input 
consists of an undirected n- vertex multi-graph G(V, E) and an integer requirement R^ for each un- 
ordered pair of nodes (i, j). Each edge e of G has a cost c(e) and an integer capacity u(e). The goal 
is to find a minimum-cost subgraph H of G such that for each pair of nodes i,j the capacity of the 
minimum-cut between i and j in H is at least Rij. This generalizes the well-known survivable network 
design problem (SNDP) problem in which all edge capacities are 1. SNDP already captures as special 
cases a variety of fundamental connectivity problems in combinatorial optimization such as the min- 
cost spanning tree, min-cost Steiner tree and forest, as well as min-cost A-edge-connected subgraph; 
each of these problems has been extensively studied on its own and several of these special cases are 
NP-hard and APX-hard to approximate. Jain, in an influential paper |15| . obtained a 2-approximation 
for SNDP via the standard cut-based LP relaxation using the iterated rounding technique. 

Although the above mentioned 2-approximation for SNDP has been known since 1998, the approx- 
imability of Cap-SNDP has essentially been wide open even in very restricted special cases. Similar 
to SNDP, Cap-SNDP is motivated by both practial and theoretical considerations. These problems 
find applications in the design of resilient networks such as in telecommunication infrastructure. In 
such networks it is often quite common to have equipment with different discrete capacities; this 
leads naturally to design problems such as Cap-SNDP. At the outset, we mention that a different and 
somewhat related problem is also referred to by the same name, especially in the operations research 
literature. In this version the subgraph H has to support simultaneously a flow of Rij between each 
pair of nodes (i,j); this is more closely related to multicommodity flows and buy-at-bulk network 
design. Our version is more related to connectivity problems such as SNDP. 

As far as we are aware, the version of Cap-SNDP that we study was introduced (in the approxi- 
mation algorithms literature) by Goemans et al. [M] in conjunction with their work on SNDP. They 
made several observations on Cap-SNDP: (i) Cap-SNDP reduces to SNDP if all capacities are the 
same, (ii) there is an 0(min(m, i? max )) approximation where m is the number of edges in G and 
-Rmax = maxjj Rij is the maximum requirement, and (iii) if multiple copies of an edge are allowed then 
there is an 0(log i? max )-approximation. We note that in the capacitated case i? max can be exponen- 
tially large in n, the number of nodes of the graph. Carr et al. [6] observed that the natural cut-based 
LP relaxation has an unbounded integrality gap even for the graph consisting of only two nodes s,t 
connected by parallel edges with different capacities. Motivated by this observation and the goal 
of obtaining improved approximation ratios for Cap-SNDP, |6J strengthened the basic cut-based LP 
by using knapsack- cover inequalities. (Several subsequent papers in approximation algorithms have 
fruitfully used these inequalities.) Using these inequalities, [6] obtained a f3(G) + 1 approximation for 
Cap-SNDP where (3(G) is the maximum cardinality of a bond in the underlying simple graph: a bond 
is a minimal set of edges that separates some pair of vertices with positive demand. Although /3(G) 
could be @(n 2 ) in general, for certain topologies — for instance, if the underlying graph is a line or a 
cycle — this gives constant factor approximations. 

The above results naturally lead to several questions. What is the approximability of Cap-SNDP? 
Should we expect a poly-logarithmic approximation or even a constant factor approximation? If not, 
what are interesting and useful special cases to consider? And do the knapsack cover inequalities help 
in the general case? What is the approximability of Cap-SNDP if one allows multiple copies? Does 
this relaxed version of the problem allow a constant factor approximation? 

In this paper we obtain several new positive and negative results for Cap-SNDP that provide new 
insights into the questions above. 
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1.1 Our Results 



We first discuss results for Cap-SNDP where multiple copies are not allowed. We initiate our study 
by considering the global connectivity version of Cap-SNDP where we want a min-cost subgraph with 
global min-cut at least R; in other words, there is a "uniform" requirement Rij = R for all pairs 
We refer to this as the Cap-R- Connected Subgraph problem; the special case when all capacities are 
unit corresponds to the classical minimum cost A-edge-connected (spanning) subgraph problem, which 
is known to be APX-hard |12j . We show the following positive result for arbitrary capacities. 

Theorem 1.1. There is a randomized O (log n)- approximation algorithm for the Cap-R-Connected 
Subgraph problem. Moreover, for any 7 > 1, there is a randomized 0(7 log n)- approximation algorithm 
with running time for "nearly uniform" Cap-SNDP when all pairwise requirements are in [R, jR]. 

To prove Theorem ll.il we begin with a natural LP relaxation for the problem. Almost all positive 
results previously obtained for the unit capacity case are based on this relaxation. As remarked already, 
this LP has an unbounded integrality gap even for a graph with two nodes (and hence for Cap-i?- 
Connected Subgraph). We strengthen the relaxation by adding the valid knapsack cover inequalities. 
Although we do not know of a polynomial time algorithm to separate over these inequalities, following 
[6] , we find a violated inequality only if the current fractional solution does not satisfy certain useful 
properties. Our main technical tool both for finding a violated inequality and subsequently rounding 
the fractional solution is Karger's theorem on the number of small cuts in undirected graphs [16j. 

We believe the approach outlined above may be useful in other network design applications. As a 
concrete illustration, we use it to solve an interesting and natural generalization of Cap-i?-Connected 
Subgraph, namely, the k-Way-TZ- Connected Subgraph problem. The input consists of (k — 1) integer 
requirements R\, . . . Rk-i, such that R% < R% < . . . < Rk-i- The goal is to find a minimum-cost 
subgraph H of G such that for each 1 < i < k — 1, the capacity of any (i + l)-way cut of G is at least 
rS it is easy to see that Cap-i?-Connected Subgraph is precisely the £;-Way-7?.-Connected Subgraph, 
with k = 2. Note that the fc-Way-7?.-Connected Subgraph problem is not a special case of the general 
Cap-SNDP as the cut requirements for the former problem are not expressible as pairwise connectivity 
constraints. Interestingly, our techniques for Cap-i?-Connected Subgraph can be naturally extended 
to handle the multiway cut requirements, yielding the following generalization of Theorem II .11 

Theorem 1.2. There is a randomized 0(k log n)- approximation algorithm for the k-Way-TZ- Connected 
Subgraph problem with running time. 

We remark that even for the unit-capacity case of this problem, it is not clear how to obtain a 
better ratio than that guaranteed by the above theorem. We discuss more in Section 12.31 

Once the pairwise connectivity requirements are allowed to vary arbitrarily, the Cap-SNDP problem 
seems to become distinctly harder. Surprisingly, the difficulty of the general case starts to manifest even 
for the simplest representative problem in this setting, where there is only one pair (s, t) with R s t > 0; 
we refer to this as the single pair problem. The only known positive result for this seemingly restricted 
case is a polynomial-factor approximation that follows from the results in [141(6] for general Cap-SNDP. 
We give several negative results to suggest that this special case may capture the essential difficulty 
of Cap-SNDP. In particular, we start by observing that the LP with knapsack cover inequalities has 
an Sl(n) integrality gap even for the single-pair problemU Next we show that the single pair problem 

An i-way cut C of a graph G(V, E) is a partition of its vertices into i non-empty sets Vi, . . . , V*; we use 8(C) to denote 
the set of edges with endpoints in different sets of the partition C. The capacity of an i-way cut C is the total capacity 
of edges in 5(C). 

2 In [6] it is mentioned that there is a series-parallel graph instance of Cap-SNDP such that the LP with knapsack-cover 
inequalities has an integrality gap of at least [/3(G) /2J +1. However, no example is given; it is not clear if the gap applied 
to a single pair instance or if /3(G) could be as large as n in the construction. 
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is ft (log log ra)-hard to approximate. 

Theorem 1.3. The single pair Cap-SNDP problem cannot be approximated to a factor better than 
ft (log log ra) unless NP C DTIME(n lo ^ lo ^ n ). 

The above theorem is a corollary of the results in Chuzhoy et al. 's work on the hardness of related 
network design problems [9j. We state it as a theorem to highlight the status of the problem. (See 
Appendix|A]for a brief proof sketch.) We further discuss this connection at the end of this section. We 
prove a much stronger negative result for the single pair problem in directed graphs. Since in the unit- 
capacity case, polynomial-time minimum-cost flow algorithms solve the single-pair problem exactly 
even in directed graphs, the hardness result below shows a stark contrast between the unit-capacity 
and the non-unit capacity cases. 

Theorem 1.4. In directed graphs, the single pair Cap-SNDP cannot be approximated to a factor 
better than 2 lo e (1 ~* )n for any < 5 < 1, unless NP C DTIME(n polylo ^ n ">). Moreover, this hardness 
holds for instances in which there are only two distinct edge capacities. 

Allowing Multiple Copies: Given the negative results above for even the special case of the single- 
pair Cap-SNDP, it is natural to consider the relaxed version of the problem where multiple copies of 
an edge can be chosen. Specifically, for any integer a > 0, a copies of e can be bought at a cost of 
a ■ c(e) to obtain a capacity a ■ u(e). In some applications, such as in telecommunication networks, 
this is a reasonable model. As we discussed, this model was considered by Goemans et al. [14J who 
gave an 0(log-R max ) approximation for Cap-SNDP. This follows from a simple O(l) approximation 
for the case when all requirements are in {0, R}. The advantage of allowing multiple copies is that one 
can group request pairs into classes and separately solve the problem for each class while losing only 
the number of classes in the approximation ratio. For instance, one easily obtains a 2-approximation 
for the single pair problem even in directed graphs, in contrast to the difficulty of the problem when 
multiple copies are not allowed. Note that this also implies an easy 2k approximation where k is the 
number of pairs with Rij > 0. We address the approximability of Cap-SNDP with multiple copies 
of edges allowed. When -R max is large, we improve the min{2fe, 0(log i? max )}-approximation discussed 
above via the following. 

Theorem 1.5. In undirected graphs, there is an O (log k)- approximation algorithm for Cap-SNDP 
with multiple copies, where k is the number of pairs with Rij > 0. 

Both our algorithm and analysis are inspired by the 0(log /^-competitive online algorithm for the 
Steiner forest problem by Berman and Coulston [5], and the subsequent adaptation of these ideas for 
the priority Steiner forest problem by Charikar et al. [7]. However, we believe the analysis of our 
algorithm is more transparent (although it gets weaker constants) than the original analysis of [5]. 

We complement our algorithmic result by showing that the multiple copy version is ft (log log n)- 
hard to approximate. This hardness holds even for the single-source Cap-SNDP where we are given 
a source node s G V, and a set of terminals T C V, such that Rij > iff i = s and j G T. Observe 
that single-source Cap-SNDP is a simultaneous generalization of the classical Steiner tree problem 
(Rij € {0, 1}) as well as both Cap-i?-Connected Subgraph and single-pair Cap-SNDP. 

Theorem 1.6. In undirected graphs, single source Cap-SNDP with multiple copies cannot be approx- 
imated to a factor better than ft (log log n) unless NP C DT/A^(n logloglog,! ). 

The above theorem, like Theorem 11.31 also follows easily from the results of [9]. For completeness, 
we provide a proof of Theorem II. 61 in Appendix [A] We note that the hardness reduction above creates 
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instances with super-polynomially large capacities. For such instances, our 0(log ^-approximation 
strongly improves on the previously known approximation guarantees. 

Related Work: Network design has a large literature in a variety of areas including computer science 
and operations research. Practical and theoretical considerations have resulted in numerous models 
and results. Due to space considerations it is infeasible even to give a good overview of closely related 
work. We briefly mention some work that allows the reader to compare the model we consider here 
to related models. As we mentioned earlier, our version of Cap-SNDP is a direct generalization of 
SNDP and hence is concerned with (capacitated) connectivity between request node pairs. We refer 
the reader to the survey [T7] and some recent and previous papers [TH [151 El EH ttU IH] for pointers 
to literature on network design for connectivity. A different model arises if one wishes to find a 
min-cost subgraph that supports multicommodity flow for the request pairs; in this model each node 
pair needs to routes a flow of Rij in the chosen graph and these flows simultaneously share the 
capacity of the graph. We observe that if multiple copies of an edge are allowed then this problem is 
essentially equivalent to the non-uniform buy-at-bulk network design problem. Buy-at-bulk problems 
have received substantial attention; we refer the reader to [8j for several pointers to this work. If 
multiple copies are not allowed, the approximability of this flow version is not well-understood; for 
example if the flow for each pair is only allowed to be routed on a single path, then even checking 
feasibility of a given subgraph is NP-Hard since the problem captures the well-known edge-disjoint 
paths and unsplittable flow problems. Andrews and Zhang [2] have recently considered special cases 
of this problem with uniform capacities while allowing some congestion (that is, a few copies) on the 
chosen edges. 

The /c-Way-7£-Connected Subgraph problem that we consider does not appear to have been con- 
sidered previously even in the unit-capacity case. 

2 The Cap-i?-Connected Subgraph problem 

In this section, we prove Theorem II . lj, giving an 0(log n)-approximation for the Cap-i?-Connected 
Subgraph problem. Here, we assume each = R; the extension to the case when requirements are 
"nearly uniform" is deferred to Appendix lB.il We start by writing a natural linear program relaxation 
for the problem; the integrality gap of this LP can be arbitrarily large. To deal with this, we introduce 
additional valid inequalities, called the knapsack cover inequalities, that must be satisfied by any 
integral solution. We show how to round this strengthened LP, obtaining an 0(log n)-approximation. 

2.1 The Standard LP Relaxation and Knapsack- Cover Inequalities 

We assume without any loss of generality that the capacity of any edge is at most R. For each subset 
S C 2 V , we use 5(S) to denote the set of edges with exactly one endpoint in S. For a set of edges 
A, we use u(A) to denote SeeA' u ( e )- We sa y * na * a se * °^ edges A satisfies (the cut induced by) S 
if u{A n S(S)) > R. Note that we wish to find the cheapest set of edges which satisfies every subset 
7^ S C V. The following is the LP relaxation of the standard integer program capturing the problem. 

mm ^c(e)x e (StdLP) 
VSQV, u(e)x e > R 

e£8(S) 

Ve 6 E, < x e < 1 
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The following example shows that ()Std LPp can have integrality gap as bad as R. 



Example 1: Consider a graph G on three vertices p, q, r. Edge pq has cost and capacity R; edge qr 
has cost and capacity R — l; and edge pr has cost C and capacity R. To achieve a global min-cut of 
size at least R, any integral solution must include edge pr, and hence must have cost C. In contrast, 
in ()Std LP|) one can set x pr = 1/R, and obtain a total cost of C/R. 

In the previous example, any integral solution in which the mincut separating r from {p, q} has size 
at least R must include edge pr, even if qr is selected. The following valid inequalities are introduced 
precisely to enforce this condition. More generally, let S be a set of vertices, and A be an arbitrary 
set of edges. Define R(S, A) = max{0, R — u(An5(S))} be the residual requirement of S that must be 
satisfied by edges in 5(S)\A. That is, any feasible solution has Ylee8(s)\A u ( e ) x e > R(S, A). However, 
any integral solution also satisfies the following stronger requirement 

min{R(S, A),u(e)}x e > R{S, A) 

e£S(S)\A 

and thus these inequalities can be added to the LP to strengthen it. These additional inequalities are 
referred to as Knapsack-Cover inequalities, or simply KC inequalities, and were first used by [6] in 
design of approximation algorithms for Cap-SNDP. 

Below, we write a LP relaxation, (IKC LPj ). strengthened with the knapsack cover inequalities. 
Note that the original constraints correspond to KC inequalities with A = 0; we simply write them 
explicitly for clarity. 

min Y c ( e ) x e ( KC LP ) 

e<£E 

VS* C V, u (e) x e > R (Original Constraints) 

e£6{S) 

Wl C£,VSCV, Y min(«(e), R(S, A))x e > R(S, A) (KC-inequalities) 

e<E6(S)\A 

Ve € E, < x e < 1 

The Linear Program (|KC LPp . like the original (|Std LP|) . has exponential size. However, unlike the 
()Std LP|) . we do not know of the existence of an efficient separation oracle for this. Nevertheless, as 
we show below, we do not need to solve (|KC LPp ; it suffices to get to what we call a good fractional 
solution. 

Definition 2.1. Given a fractional solution x, we say an edge e is nearly integral if x e > 40 i og n , and 
we say e is highly fractional otherwise. 

Definition 2.2. For any a > 1, a cut in a graph G with capacities on edges, is an a-mincut if its 

capacity is within a factor a of the minimum cut of G. 

Theorem 2.3. [Theorems 4.7.6 and 4.7.7 of [16J] The number of a-mincuts in an n-vertex graph is at 
most n 2a . Moreover, the set of all a-mincuts can be found in 0(n 2a log 2 n) time with high probability. 

Given a fractional solution x to the edges, we let A x denote the set of nearly integral edges, that 
is, A x := {e € E : x e > 4Q1 ^ gn }. Define u(e) = u(e)x e to be the fractional capacity on the edges. Let 
S := {S C V : u(5(S)) < 2R}. A solution x is called good if it satisfies the following three conditions: 

(a) The global mincut in G with capacity u is at least R, i.e. x satisfies the original constraints. 
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(b) The KC inequalities are satisfied for the set A x and the sets in S. Note that if (a) is satisfied, 
then by Theorem 12.31 |<S| < n 4 . 

(c) X^eG-E c ( e ) x e ls & t most the value of the optimum solution to (|KC LPp . 

Note that a good solution need not be feasible for ([KC LPp as it is required to satisfy only a subset 
of KC-inequalities. We use the ellipsoid method to get such a solution. Such a method was also used 
in ©. 

Lemma 2.4. There is a randomized algorithm that computes a good fractional solution with high 
probability. 

Proof: We start by guessing the optimum value M of (|KC LPp and add the constraint J2 e ^E c ( e ) x e < 
M to the constraints of (jKC LP|h If the guessed value is too small, a good solution may not exist; how- 
ever, a simple binary search suffices to identify the smallest feasible value of M. With this constraint 
in place, we will use the ellipsoid method to compute a solution that satisfies (a), (b), and (c) with 
high probability. Since we do not know of a polynomial-time separation oracle for KC inequalities, we 
will simulate a separation oracle that verifies condition (b), a subset of KC inequalities, in polynomial 
time. Specifically, we give a randomized polynomial time algorithm such that given a solution x that 
violates condition (b), the algorithm detects the violation with high probability and outputs a violated 
KC inequality. We now describe the entire process. 

Given a solution x we first check if condition (a) is satisfied. This can be done in polynomial time 
by 0(n) max-flow computations. If (a) is not satisfied, we have found a violated constraint. Once we 
have a solution that satisfies (a), we know that \S\ < n 4 . By Theorem 1 2. 3 j the set S can be computed 
in polynomial-time with high probability. Thus we can check condition (b) in polynomial-time, and 
with high-probability find a violating constraint for (b) if one exists. Once we have a solution that 
satisfies both (a) and (b), we check if YleeE c ( e ) x e < M. If not, we have once again found a violated 
constraint for input to the ellipsoid algorithm. Thus in polynomially many rounds, where each round 
runs in polynomial-time, the ellipsoid algorithm combined with the simulated separation oracle, either 
returns a solution x that satisfies (a), (b), and ^ egE c(e)x e < M, with high probability, or proves 
that the system is infeasible. Using binary search, we find the smallest M for which a solution x is 
returned satisfying conditions (a), (b) and ^ eg £ c(e)x e < M. Since M is less than the optimum value 
of (jKC LP p . we get that the returned x is a good fractional solution with high probability. □ 

2.2 The Rounding and Analysis 

Given a good fractional solution x, we now round it to get a O(logn) approximation to the Cap-i?- 
Connected Subgraph problem. A useful tool for our analysis is the following Chernoff bound (see [IB] , 
for instance): 

Lemma 2.5. Let X\, X2, ■ ■ ■ Xk be a collection of independent random variables in [0,1], let X = 
Xi, and let \x = E[X]. The probability that X < (1 — 5)fi is at most e~^ I 2 . 

We start by selecting A x , the set of all nearly integral edges. Henceforth, we lose the subscript and 
denote the set as simply A. Let F = E\A denote the set of all highly fractional edges; for each edge 
e € F, select it with probability (40 log n ■ x e ). Let F* C F denote the set of selected highly fractional 
edges. The algorithm returns the set of edges Ea ■= A U F*. 

It is easy to see that the expected cost of this solution Ea is 0(log n) ^2 e( z E c(e)x e , and hence 
by condition (c) above, within O(logn) times that of the optimal integral solution. Thus, to prove 
Theorem II. 1[ it suffices to prove that with high probability, Ea satisfies every cut in the graph G; 
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we devote the rest of the section to this proof. We do this by separately considering cuts of different 
capacities, where the capacities are w.r.t u (recall that u(e) = u(e)x e ). Let C be the set of cuts of 
capacity at least 2R, that is, C := {S C V : u(5(S)) > 2R}. 

Lemma 2.6. Pr[ VS € C : u(E A n 5(S)) > R] > 1 - ^nj. 

Proof: We partition C into sets £ 2 ,£ 3 , • • • where Cj := {S C V : jR < u(5(S)) < (j + 1)R}. Note 
that Theorem 12,31 implies \CA < n 2 ^ +1 ) by condition (a) above. Fix j, and consider an arbitrary cut 
S G Cj. If u(A PI 5(S)) > R, then S is clearly satisfied by E A - Otherwise, since the total ^-capacity of 
S is at least jR, we have u(F n -5(5)) > u(5(S)) - u{A n -5(5)) > (j - l)i?. Thus 

eGFn<5(5) 

Recall that an edge e € F is selected in F* with probability (40 log n • x e ). Thus, for the cut S, the 
expected value of ^2 e& p* n s(s) — ~~ 1) l°§ n - Since u(e)/R < 1, we can apply Lemma [231 to get 
that the probability that S is not satisfied is at most e _161ogn ^ -1 ^ = 1/n 16 ^ -1 '. Applying the union 
bound, the probability that there exists a cut in Cj not satisfied by E A is at most n 2 ^ +1 )/n 16 ^ -1 ^ = 
n ~ 3 . Thus probability that some cut in C is not satisfied is bounded by ^j>2 nl8_14j — 2n~ 10 if 
n > 2. Hence with probability at least 1 — l/2n 10 , AU F* satisfies all cuts in C. □ 

One might naturally attempt the same approach for the cuts in <S (recall that S = {S C V : 
u(5(S)) < 2i?}) modified as follows. Consider any cut S, which is partly satisfied by the nearly 
integral edges A. The fractional edges contribute to the residual requirement of S, and since x e is 
scaled up for fractional edges by a factor of 40 log n, one might expect that F* satisfies the residual 
requirement, with the logn factor providing a high-probability guarantee. This intuition is correct, 
but the KC inequalities are crucial. Consider Example 1; edge pr is unlikely to be selected, even after 
scaling. In the statement of Lemma [2 .5[ it is important that each random variable takes values in [0, 1]; 
thus, to use this lemma, we need the expected capacity from fractional edges to be large compared to 
the maximum capacity of an individual edge. But the KC inequalities, in which edge capacities are 
"reduced" , enforce precisely this condition. Thus we get the following lemma using a similar analysis 
as above. 

Lemma 2.7. Pr[ VS" G S : u(5(E A U 5{S))) > R] > l-^r- 

The 0(logn)-approximation guarantee for the Cap-i?-Connected Subgraph problem stated in The- 
orem [LTJ follows from the previous two lemmas. 

2.3 The A;- Way— 7^-Connected Subgraph Problem 

The fc-Way-7£-Connected Subgraph problem that we define is a natural generalization of the well- 
studied min-cost A-edge-connected subgraph problem. The latter problem is motivated by applica- 
tions to fault-tolerant network design where any A — 1 edge failures should not disconnect the graph. 
However, there may be situations in which global A-connectivity may be too expensive or infeasible. 
For example the underlying graph G may have a single cut-edge but we still wish a subgraph that 
is as close to 2-edge-connected as possible. We could model the requirement by /c-Way-7£-Connected 
Subgraph (in the unit-capacity case) by setting R\ = 1 and R2 = 3; that is, at least 3 edges have to 
be removed to partition the graph into 3 disconnected pieces. 

We briefly sketch the proof of Theorem 11.21 We work with a generalization of (|KC LPp to i-way 
cuts, with an original constraint for each i + 1-way cut, 1 < i < k — 1, and with KC inequalities added. 
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The algorithm is to select all nearly integral edges e (those with x e > 40fc log n ) , and select each of the 
remaining (highly fractional) edges e with probability 40A; log n ■ x e . The analysis is very similar to 
that of Theorem 11.11 and hence moved to the appendix; we use the following lemma on counting A;- way 
cuts in place of Theorem 12.31 

Lemma 2.8 (Lemma 11.2.1 of |16j). In an n-vertex graph, the number of k-way cuts with capacity at 
most a times that of a minimum k-way cut is at most n 2a i k - 1 ) . 

It would be interesting to explore algorithms and techniques for other more general variants of the 
k- Way-Tv!.- Connected Subgraph problem that we consider here. 

3 Single-Pair Cap-SNDP 

In this section we show that the integrality gap with KC inequalities is f2(n) even for single-pair 
Cap-SNDP in undirected graphs. Moreover, when the underlying graph is directed, we show that the 
single-pair problem is hard to approximate to within a factor of 2 log 1 * n for any 5 > 0. 

3.1 Integrality Gap with KC Inequalities 

We show that for any positive integer R, there exists a single-pair Cap-SNDP instance G with (R + 2) 
vertices such that the integrality gap of the natural LP relaxation strengthened with KC inequalities is 
£l(R). The instance G consists of a source vertex s, a sink vertex t, and R other vertices vi,V2, • • • , vr. 

There is an edge of capacity 2 and cost 1 (call these small edges) 
between s and each Vi, and an edge of capacity R and cost R between 
each Vi and t (large edges). We have R s t = R. Clearly, an optimal 
integral solution must select at least R/2 of the large edges (in addition 
to small edges), and hence has cost greater than R 2 /2. The instance is 
depicted in the accompanying figure: Label (u, c) on an edge denotes 
capacity u and cost c. 

We now describe a feasible LP solution: set x e = 1 on each small 
edge e, and x e < = 2/R on each large edge e'. The cost of this solution 
is R from the small edges, and 2R from the large edges, for a total of 
3R. This is a factor of R/6 smaller than the optimal integral solution, 
proving the desired integrality gap. 

It remains only to verify that this is indeed a feasible solution to (|KC LPp . Consider the constraint 
corresponding to sets S, A. As edges in A \ 5(S) play no role, we may assume A C 5(S). If A includes 
a large edge, or at least R/2 small edges, the residual requirement R(S, A) that must be satisfied by 
the remaining edges of 6(S) is 0, and so the constraint is trivially satisfied. Let A consist of a < R/2 
small edges; the residual requirement is thus R — 2a. Let S(S) contain % large edges and thus R — i 
small edges. Now, the contribution to the left side of the constraint from small edges in 6(S) \ A is 
2(R — i — a) = (R — 2a) + (R — 2i). Therefore, the residual requirement is satisfied by small edges alone 
unless i > R/2. But the contribution of large edges is i ■ j| • (R — 2a) which is greater than R — 2a 
whenever i > R/2. Thus, we satisfy each of the added KC inequalities. 

3.2 Hardness of Approximation in Directed Graphs 

We now prove Theorem 11.41 via a reduction from the label cover problem [J]. 
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Definition 3.1 (Label Cover Problem). The input consists of a bipartite graph G(A\J B,E) such 
that the degree of every vertex in A is oIa and degree of every vertex in B is ds, a set of labels La 
and a set of labels Lb, and a relation 7T( a ,&) Q La X Lb for each edge (a, 6) £ E. Given a labeling 
(f) : AU B — )• La U Lb, an edge e = (a,b) £ E is said to be consistent iff (4>(a), 4>(b)) £ 7T(a,b)- The goal 
is to find a labeling that maximizes the fraction of consistent edges. 

The following hardness result for the label-cover problem is a well-known consequence of the PCP 
theorem [3] and Raz's Parallel Repetition theorem |20| . 

Theorem 3.2 ([3 [20]). For any e > 0, there does not exist a poly-time algorithm to decide if a given 
instance of label cover problem has a labeling where all edges are consistent (Yes-Instance), or if no 
labeling can make at least i fraction of edges to be consistent for 7 = 2 log e ™ (No-Instance), unless 
NP C DTLM £(n polylog(n ) ) . 

We now give a reduction from label cover to the single-pair Cap-SNDP in directed graphs. In our 
reduction, the only non-zero capacity values will be 1, dA, and ds- We note that Theorem 13.21 holds 
even when we restrict to instances with dA = c?_b- Thus our hardness result will hold on single-pair 
Cap-SNDP instances where there are only two distinct non-zero capacity values. 

Given an instance / of the label cover problem with m edges, we create in polynomial-time a 
directed instance V of single-pair Cap-SNDP such that if I is a Yes-Instance then /' has a solution 
of cost at most 2m, and otherwise, every solution to V has cost 0(7717^). This establishes Theorem 
11.41 when we choose e = 5/2. 

The underlying graph G'(V',E') for the single-pair Cap-SNDP instance is constructed as follows. 
The set V contains a vertex v for every v £ AL) B. We slightly abuse notation and refer to these sets 
of vertices in V' as A and B as well. Furthermore, for every vertex a £ A, and for every label I £ La, 
the set V contains a vertex a(£). Similarly, for every vertex b £ B, and for every label £ € Lb, the 
set V contains a vertex b(i). Finally, V contains a source vertex s and a sink vertex t. The set E' 
contains the following directed edges: 

• For each vertex a in A, there is an edge from s to the vertex a of cost and capacity dA- For 
each vertex b £ B, there is an edge from b to t of cost and capacity ds- 

• For each vertex a £ A, and for all labels I in La, there is an edge from a to a(£) of cost dA and 
capacity dA- For each vertex b £ B, and for all labels £ in Lb, there is an edge from b(£) to b of 
cost ds and capacity ds- These two types of edges are the only edges with non-zero cost. 

• For every edge (a, b) £ E, and for every pair of labels (£ a ,£b) £ 7T(a,&)> there is an edge from a(£ a ) 
to b(£b) of cost and capacity 1. 

This completes the description of the network G' . The requirement R s t between s and t is m, the 
number of edges in the label cover instance. It is easy to verify that the size of the graph G' is at most 
quadratic in the size of the label cover instance, and that G' can be constructed in polynomial-time. 
The lemmas below analyze the cost of Yes-Instance and No-Instance instances. 

Lemma 3.3. If the label cover instance is a Yes-Instance, then G' contains a subgraph of cost 2m 
which can realize a flow of value m from s to t. 

Proof:Let 4> be any labeling that consistently labels all edges in G(AL)B, E). Also, let E\ C E' be the 
set of all edges of cost in E' , and let E 2 C E' be the set of edges {(a, a(4>(a))) \ a £ A}U{(b((f)(b)),b) : 
b £ B}. We claim that E\ UE2 is a feasible solution for the single-pair Cap-SNDP instance. Note that 
the total cost of edges in E\ U E2 is |-A[dj4 + \B\ds = 2m. We now exhibit a flow of value m from s to 
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t in G"(V', Ei U E2). A flow of value (La is sent along the path s — > a — > a(4>(a)) for all a G A. From 
a((p(a)), a unit of flow is sent to the d\ vertices of the form {&(</>(&)) | b G B and (a, b) G -E}; this is 
feasible because 4> consistently labels all edges in E. Thus each vertex of the form b(4>(b)) where b G B 
receives ds units of flow, since the degree of b is ds in G. A flow of value ds is sent to t along the 
path b((f>(b)) -4 6 -4 t. Thus s sends out a flow of value |>1|gLi = m, or equivalently, t receives a flow 
of value \B\d,B = m. □ 

Lemma 3.4. // the label cover instance is a No-Instance, then any subgraph of G' that realizes a 
flow of m units from s to t has cost 0(77774). 

Proof: Let p = 7 1 / 4 /2, and M = 32/15. Assume by way of contradiction, that there exists a subgraph 
G"(V' , E") of G' of cost strictly less than ^ that realizes m units of flow from s to t. We say a vertex 
a G A is light if the number of edges of the form {(a, a(£)) \ £ G La} in G" is less than p. Similarly, we 
say a vertex b G B is if the number of edges of the from {(b(£), b) \ £ £ Lb} in G" is less than /?. 
All other vertices iniUB are referred to as heavy vertices. Note that at most 1/M fraction of vertices 
in A could be heavy, for otherwise the total cost of the edges in E" would exceed ^ ■ p ■ d\ = 
Similarly, at most 1/M fraction of vertices in B could be heavy. 

Now fix any integral s-t flow / of value m in G"; an integral flow exists since all capacities are 
integers. We start by deleting from G" all heavy vertices. Since at most 1/M fraction of either A or B 
are deleted, the total residual flow in this network is at least (1 — jj)m = f| (recall that M = 32/15) 
since at most units of flow can transit through a vertex in A, and at most ds units of flow can 
transit through a vertex in B. 

Let F be a decomposition of the residual flow into unit flow paths. Note that \F\ = m/16. By 
construction of G', every flow path / G F is of the form s -4 a -4 £ a — > 4 — s> b -4 t where the pair 
(£ a , £b) G 7T( a m . We say that an edge (a, b) G E is a good edge if there is a flow path / of the above form, 
and we say / is a certificate for edge (a, b) being good. Note that every flow path / is a certificate of 
exactly one edge (a, b). We claim that there are at least good edges in G. It suffices to show that 
for any edge (a, b) G E, at most p 2 flow paths in F can certify that (a, b) as a good edge. Since a and 
6 are both light vertices, we know that \{(a,£ a ) \ £ a G La} PiE"\ < p and &) I 4 G L^} flE 1 "! < p. 
Now using the fact that each edge (£ a ,£b) has unit capacity, it follows that at most p 2 paths in F can 
certify (a, b) as a good edge. Hence number of good edges in E is at least 

We now show existence of a labeling 4> that makes at least ^ fraction of the edges to be consistent, 
contradicting that we were given a No-Instance of label cover. For a vertex a G A, let T(a) := {£ a G 
Lyi I (a,£ a ) G -E"}. Similarly, we define T(b) for each vertex b £ B. Consider the following random 
label assignment: each vertex a G A is assigned uniformly at random a label from T(a), and each 
vertex in B is assigned uniformly at random a label in T(b). For any good edge (a, 6), the probability 
that the random labeling makes it consistent is at least 4y since |T(a) | and |r(6)| are both less than 
p (as a and 6 are light), and there exists an £ a G Ta and 4 G Tb such that (£ a ,£b) £ ^(a.ft)- Thus, in 
expectation, at least fraction of good edges are made consistent by the random assignment. Hence 
there exists a labeling cf> that -g?^ = ™ edges in G consistent. □ 

Since the graph G' can be constructed from G in poly-time, it follows that a poly-time (7 1 ^ 4 /5)- 
approximation algorithm for single-pair Cap-SNDP would give a poly-time algorithm to decide whether 
a given instance of label cover is a Yes-Instance or a No-Instance. 
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4 Cap-SNDP with Multiple Copies Allowed 



We now consider the version of Cap-SNDP when multiple copies of any edge e can be chosen; that is, 
for any integer a > 0, a copies of e can be bought at a cost a ■ c(e) to obtain a capacity of a ■ u(e). 
Allowing multiple copies makes the problem easier, and Goemans et al. [H] give a 0(log R max ) factor 
approximation algorithm for the problem. In this section, we complement this result with a 0(logk) 
factor approximation algorithm, where k is the number of (i,j) pairs with Rij >O0 Our algorithm is 
inspired by the work of Berman and Coulston [5 J on online Steiner Forest. For notational convenience, 
we rename the pairs (si,t\), • • • , (sk,tk), and denote the requirement R Si ,u as Ri; the vertices Si,U 
are referred to as terminals. We also assume that the pairs are so ordered that R\ > R2 > ■ ■ ■ > Rk- 

We first give an intuitive overview of the algorithm. The algorithm considers the pairs in decreasing 
order of requirements, and maintains a forest solution connecting the pairs that have been already 
been processed; that is, if we retain a single copy of each edge in the partial solution constructed so 
far, we obtain a forest F. For any edge e on the path in F between sj and tj, the total capacity 
of copies of e will be at least Rj. When considering S{,ti, we connect them as cheaply as possible, 
assuming that edges previously selected for F have cost. (Note that this can be done since we are 
processing the pairs in decreasing order of requirements and for each edge already present in F, the 
capacity of its copies is at least Ri.) The key step of the algorithm is that in addition to connecting 
Si and ti, we also connect the pair to certain other components of F that are "nearby". The cost 
of these additional connections can be bounded by the cost of the direct connection costs between 
the pairs. These additional connections are useful in allowing subsequent pairs of terminals to be 
connected cheaply. In particular, they allow us to prove a 0(log k) upper bound on the approximation 
factor. 

We now describe the algorithm in more detail. The algorithm maintains a forest F of edges that 
have already been bought; F satisfies the invariant that, after iteration i — 1, for each j < i — 1, F 
contains a unique path between Sj and tj. In iteration i, we consider the pair Sj, ij. We define the cost 
function Cj(e) as Cj(e) := for edges e already in F, and Cj(e) := c(e) + ^yc(e), for edges e ^ F. Note 
that for an edge e ^ F, the cost Cj(e) is sufficient to buy enough copies of e to achieve a total capacity 
of Ri. Thus it suffices to connect Sj and U and pay cost Cj(e) for each edge; in the Cap-SNDP solution 
we would pay at most this cost and get a feasible solution. However, recall that our algorithm also 
connects s, and ti to other "close" components; to describe this process, we introduce some notation: 

For any vertices p and q, we use di(p, q) to denote the distance between p and q according to the 
metric given by edge costs q(e). We let li := di(si,ti) be the cost required to connect Sj and ti, given 
the current solution F. We also define the class of a pair (sj,tj), and of a component: 

• For each j < i, we say that pair (sj,tj) is in class h if 2 h < £j < 2 h+l . 
Equivalently, class(j) = [log €j J . 

• For each connected component X of F, class(X) = max^. ^^jf class(j). 

Now, the algorithm connects Sj (respectively ti) to component X if di(si,X) (resp. di(ti,X)) < 
2mm{ciass(i),ciass(x)}^ ^hat is, if X is close to the pair (sj,tj) compared to the classes they are in, we 
connect X to the pair. As we show in the analysis, this extra connection cost can be charged to some 
pair (sj,tj) in the component X. The complete algorithm description is given below. 

3 Note that we overload the letter 'fc', previously used in the definition of the fc-Way-7?.-Connected Subgraph problem; 
this should cause no ambiguity as we discuss only pairwise connectivity requirements in this section. 
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Cap-SNDP-MC : 

F <— {(F is the forest solution returned)) 

For i <— 1 to k 

For each edge e G F, Cj(e) ^— 

For each edge e ^ F, Cj(e) «— c(e) + (Ri/u(e))c(e) 

Add to F a shortest path (of length ti) from Sj to t% under distances Cj(e) 
class(i) <— [log^jj 

For each connected component X of F 

If di(Si,X) < 2 min {dass(i),class(X)} 

Add toFa shortest path connecting Sj and X 
For each connected component X of F 

If di(U,X) < 2 min {class(i),class(X)} 

Add toFa shortest path connecting ti and X 
Buy \Ri/u e ~] copies of each edge e added during this iteration. 



We prove that this algorithm Cap-SNDP-MC gives an O(logfc) approximation. 



The structure of our proof is as follows: Recall that li was the direct connection cost between Si and ti\ 
in addition to paying ti to connect these vertices, the algorithm also buys additional edges connecting Si 
and ti to existing components. We first show (in Lemma [4. ip that the total cost of extra edges bought 
can be charged to the direct connection costs; thus, it suffices to show that Yli^i ^ 0(logfc)OPT, 
where OPT is the cost of an optimal solution. To prove this (Xemma l4.2p . we bucket the pairs (si,ti) 
into 0(logk) groups based on class(i), and show that in each bucket h, Y^i:ciaaa(i)=h ^ — O(OPT). 

Lemma 4.1. The total cost of all edges bought by Cap-SNDP-MC is at most 9 Yli=i ^i- 

Proof: Let Fi denote the set of edges added to F during iteration i. First, note the total cost paid 
for copies of edge e G Fi is [~^§y]c(e) < c(e) + ^c(e) = q(e). Thus, it suffices to show: 

k k 
i=l eS-Fi i=l 

We prove that the total cost of the additional edges bought is at most 8 X^f=i > this clearly implies 
the desired inequality. It is not true that for each i, the total cost of additional edges bought during 
iteration i is at most 8£i. Nonetheless, a careful charging scheme proves the needed bound on total 
cost. In iteration i, suppose we connect the pair (sj,tj) to the components X%, . . . ,X r . We charge the 
cost of connecting (si,ti) and component Xj to the connection cost lj of a pair (sj,tj) in Xj. This is 
possible since we know the additional connection cost is at most 2 class (^). Care is required to ensure 
no pair is overcharged. To do so, we introduce some notation. 

At any point during the execution of the algorithm, for any current component X of F, we let 
Leader(X) be a pair (si,ti) G X such that class(i) = class(X). For integers h < class(A), 
/i-Leader(X) will denote a pair (sj,tj) in X; we explain how this pair is chosen later. (Initially, 
/i-Leader(X) is undefined for each component X.) 

Now, we have to account for additional edges bought during iteration i; these are edges on a shortest 
path connecting Sj (or ti) to some other component X; we assume w.l.o.g. that the path is from Sj 
to X. Consider any such path P connecting Si to a component X; we have ^ e eP c *( e ) = di(si,X) < 
2 min{ciass(i),ciass(X)}_ Let h _ [_i og di{s u X)\: Charge all edges on this path to /i-Leader(X) if it is 
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defined; otherwise, charge all edges on the path to Leader(X). In either case, the pair (sj, ti) becomes 
the ^-Leader of the new component just formed. Note that a pair (si,ti) could simultaneously be 
the /ii-Leader, /12-Leader, etc. for a component X if (sj,tj) connected to many components during 
iteration i. However, it can never be the /i-Leader of a component for h > class(i), and once it has 
been charged as /i-Leader, it is never charged again as /i-Leader. Also observe that if a pair is in a 
component X whose /i-Leader is defined, subsequently, it always stays in a component in which the 
/i-Leader is defined. 

For any i, we claim that the total charge to pair (sj, ti) is at most 8£i, which completes the proof. 
Consider any such pair: any charges to the pair occur when it is either Leader or /i-Leader of its 
current component. First, consider charges to (si,ti) as Leader of a component. Such a charge can 
only occur when connecting some Sj (or tj) to X. Furthermore, if h = [log dj(s j, X)\ < class(A) = 
class(i), the /i-Leader(A) must be currently undefined, for otherwise the h-Leader(X) would have 
been charged. Subsequently, the /i-Leader of the component containing (sj,^) is always defined, and 
so (sj, ti) will never again be charged as a Leader(A) by a path of length in [2 h , 2 h+1 ). Therefore, the 
total charge to (sj,ij) as Leader of a component is at most Y^h^i^ 2 h+1 < 2 class M+ 2 < 4^. 

Finally, consider charges to (sj,tj) as /i-Leader of a component. As observed above, h < class(i). 
Also for a fixed h, a pair is charged at most once as /i-Leader. Since the total cost charged to 
(si,ti) as /i-Leader is at most 2 h+1 ; summing over all h < class(i), the total charge is less than 

2Class(i)+2 _ 

Thus, the total charge to (sj,tj) is at most 4£i + 4£i = 8£i, completing the proof. □ 

Lemma 4.2. If OPT denotes the cost of an optimal solution to the instance of Cap-SNDP with 
multiple copies, then Ya=\^ < 64(|Togfcl + l)OPT. 

Proof: Let Ch denote J2i-ci&ss(i)=h ^- Clearly, Yli=i^i = J2h^h- The lemma follows from the two 
sub-claims below: 

Sub-Claim 1: J2 h C h < ( 2 ( [log *:] + 1)) ■ ma X/l C h 
Sub-Claim 2: For each h, C h < 320PT. 

Proof of Sub-Claim 1: Let h! = maxj class(z). We have Ch' > 2 h> , and for any terminal i such 

that class(i) < h! — ([log A;] + 1), we have £i < 2 ^ ■ Thus, the total contribution from such classes 

is at most %r ■ k = 2 h ' , and hence: 

k ' h' 



Ch > — ^ — , which implies 

h=h' ~\log k] 

max Ch > —-F, — ^ 

-l\ogk]<h<h' 2 ( | log fc | + 1) 

□ 



It remains to show Sub-Claim 2, that for each h, Ch < 320PT. Fix h. Let Sh denote the set of 
pairs Si, ti such that class(i) = h. Our proof will go via the natural primal and dual relaxations for the 
Cap-SNDP problem. In particular, we will exhibit a solution to the dual relaxation of cost C/j/32. To 
do so we will require the following claim. Define ball(sj,r), a ball of radius r around s« as containing 
the set of vertices v such that di(si, v) < r and the set of edges e = uv such that di{si, {u, v})+Ci(e) < r. 
An edge e is partially within the ball if di(si, {u, v}) < r < di(si, {u, v}) + Cj(e). Subsequently, we 
assume for ease of exposition that no edges are partially contained within the balls we consider; this 
can be achieved by subdividing the edges as necessary. Similarly, we define ball(tj,r), the ball of 
radius r around tj. Two balls are said to be disjoint if they contain no common vertices. 
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Claim 4.3. There exists a subset of pairs, S' h C Sh, \S' h \ > \Sh\/2, and a collection of \S' h \ disjoint 
balls of radius 2 h /A centred around either Si or ti, for every pair (sj,tj) € S' h . 

We prove this claim later; we now use it to complete the proof of Sub-Claim 2. First we describe 
the LP. Let the variable x e denote whether or not edge e is in the Cap-SNDP solution. Let Vi be the 
set of paths from Si to ti. For each P £ Vi, variable fp denotes how much flow t sends to the root 
along path P. We use iij(e) to refer to min{i?j, u(e)}, the effective capacity of edge e for pair (si,ti). 



Primal min c e 



(V* £ [k]) 



Pev t 



fp < Ui(e)x e (Vie[k],eeE) 

P£Vt\eeP 

X e ,fp > 



Dual max RjCtj 
ter 

y~]uj{e)Pi, e < c e (Ve G E) 



on < J2fa,e (Vie[k],PeVi 

eGP 



We now describe a feasible dual solution of value at least C^/32 using Claim [4T3l For (sj,ij) € S' h , 
if there is a ball B around Sj (or equivalently ti), we define ^ e = c(e)/iij(e) for each edge in the ball. 
Since the balls are disjoint, the first inequality of the dual is clearly satisfied. Set on = 2 h /8Ri. For 
any path P G Vi, we have 



eeP 



1 x - Rjc(e) 
Ri ^ uAe) 



>— y 



1 2 



Ric(e) , . 1 v—v , . 

+ c(e) > > Ci(e)> 

u(e) K ' ~ 2R { ^ lK ' ~ 2Ri 4 

1 eePHB y ' 1 e&PnB 1 



h 



a; 



where the first inequality used Ui(e) < Ri, the second follows from the definition of Cj(e), and the last 
inequality follows from the definition of ball(sj, 2 h /4). Thus, on = 2 h /8Ri, is feasible along with these 
Pi^'s. This gives a total dual value of 



1511 > 
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IS/,. I > 



32 ^ 



where the last inequality follows from the fact that class(i) 
Claim 14.31 which we now prove. 



Ch 
' 32 

h. This proves the lemma modulo 



Proof of Claim 14.3b We process the pairs in Sh in the order they are processed by the original 
algorithm and grow the balls. We abuse notation and suppose these pairs are (si,ii), • • • , (s p ,t p ). We 
maintain a collection of disjoint balls of radius r = 2 h /4, initially empty. 

At stage i, we try to grow a ball of radius r around either Sj or £j. If this is not possible, the ball 
around intersects that around some previous terminal in S' h , say sf, similarly, the ball around ti 
intersects that of a previous terminal, say tg. Let v be a vertex in ball(sj, r) and ball(sj, r). We have 
di(si,Sj) < di{si,v) +di{v,Sj) < di{si,v) +dj(v,Sj) < 2 h /2. (The second inequality follows because 
for any j < i and any edge e, Cj(e) < Cj(e).) Similarly, we have ck(ti,tg) < 2 h /2. 

Now, we observe that Sj and t£ could not have been in the same component of F at the beginning 
of iteration i of Cap-SNDP-MC; otherwise di(si,ti) < di(si,Sj) + di(ti,t£) < 2 h , contradicting that 
class(i) = h. But since di(si,Sj) < 2^/2 and class(z) = class(j) = h, we connect Sj to the 
component of Sj during iteration i; likewise, we connect ti to the component of tg during this iteration. 
Hence, at the end of the iteration, Si,ti,Sj,ti are all in the same component. As a result, the number 
of components of F containing pairs of Sh decreases by at least one during the iteration. 
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It is now easy to complete the proof: During any iteration of F corresponding to a pair (sj, ij) £ Sh, 
the number of components of F containing pairs of can go up by at most one. Say that an iteration 
succeeds if we can grow a ball of radius r around either Sj or ti, and fails otherwise. During any 
iteration that fails, the number of components decreases by at least one; as the number of components 
is always non-negative, the number of iterations which fail is no more than the number which succeed. 
That is, \S' h \ > \S h - S' h \. □ 

□ 

Theorem 11.51 is now a straightforward consequence of Lemmas 14.11 and 14.21 
Proof of Theorem II. 5 1 The total cost of edges bought by the algorithm is at most Yli=i YleeF c «( e ) — 
9 Y!l=i 4 by LemmaSU But Ym=i h < 64 ( [ lo g k ~\ + l)OPT, by LemmaEZl and hence the total cost 
paid by Cap-SNDP-MC is at most 0(log/c)OPT. □ 

5 Conclusions 

In this paper we made progress on addressing the approximability of Cap-SNDP. We gave an O(logn) 
approximation for the Cap-i2-Connected Subgraph problem, which is a capacitated generalization of 
the well-studied min-cost A-edge-connected subgraph problem. Can we improve this to obtain an 
0(1) approximation or prove super-constant factor hardness of approximation? We also highlighted 
the difficulty of Cap-SNDP by focusing on the single pair problem, and showing both super-constant 
hardness and an £l(n) integrality gap example, even for the LP with KC inequalities. We believe that 
understanding the single pair problem is the key to understanding the general case. In particular, we 
do not have a non-trivial algorithm even for instances in which the edge capacities are either 1 or U; 
this appears to capture much of the difficulty of the general problem. As we noted, allowing multiple 
copies of edges makes the problem easier; in practice, however, it may be desirable to not allow too 
many copies of an edge to be used. It is therefore of interest to examine the approximability of Cap- 
SNDP if we allow only a small number of copies of an edge. Does the problem admit a non-trivial 
approximation if we allow O(l) copies or, say, O(logn) copies? This investigation may further serve 
to delineate the easy versus difficult cases of Cap-SNDP. 

Acknowledgements: CC's interest in capacitated network design was inspired by questions from 
Matthew Andrews. He thanks Mathew Andrews and Lisa Zhang for several useful discussions on their 
work on capacitated network design for multi-commodity flows. 
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A Hardness of Approximation for Cap-SNDP in Undirected Graphs 



In this section, we prove Theorem 11,61 via a reduction from the Priority Steiner Tree problem. In the 
Priority Steiner Tree problem, the input is an undirected graph G(V, E) with a cost c(e) and a priority 
-P(e) E {1, 2, . . . , k} for each edge e. (We assume k is the highest and 1 the lowest priority.) We are 
also given a root r and a set of terminals T C V — {r}; each terminal t £ T has a desired priority 
P{t). The goal is to find a minimum-cost Steiner Tree in which the unique path from each terminal t 
to the root consists only of edges of priority P(t) or higher@ 

Chuzhoy et al. [9] showed that one cannot approximate the Priority Steiner Tree problem within 
a factor better than ri(loglogn) unless NP C DTIME(n} ogloglogn ), even when all edge costs are 
or 1. Here, we show an approximation-preserving reduction from this problem to Cap-SNDP with 
multiple copies; this also applies to the basic Cap-SNDP problem, as the copies of edges do not play 
a significant role in the reduction. 

Given an instance X ps t of Priority Steiner Tree on graph G(V,E) with edge costs in {0,1}, we 
construct an instance Z cap of Cap-SNDP defined on the graph G as the underlying graph. Fix R to 
be any integer greater than 2m? where m is the number of edges in the graph G. We now assign a 
capacity of u(e) = R l to each edge e with priority P{e) = i in X pst . Each edge e of cost in I pst has 
cost c(e) = 1 in T cap , and each edge e of cost 1 in l pst has cost c(e) = m 2 in T cap . Finally, for each 
terminal i, set Rt r = R 1 if P(t) = i; for every other pair of vertices (p, q), R pq = 0. 

Let C denotes the cost of an optimal solution to I ps t, note that C < m\ we now argue that 
X ps t has an optimal solution of cost C iff T cap has an optimal solution of of cost between Cm? and 
Cm 2 + m < (C + l)m 2 . Given a solution E* to X pst of cost C, simply select the same edges for T cap \ 
the cost in X cap is at most Cm 2 + m since in T cap , we pay 1 for each edge in E* that has cost in I ps t- 
This is clearly a feasible solution to X cap as each terminal t has a path to r in E* containing only edges 
with priority at least P(t), which is equivalent to having capacity at least Rt r - Conversely, given a 
solution E' to T cap with cost in [Cm 2 , (C + l)m 2 ), select a single copy of each edge in E' as a solution 
to 

I ps t] clearly the total cost is at most C. To see that this is a feasible solution, suppose that E 1 did 
not contain a path from some terminal t to the root r using edges of priority P(t) or more. Then there 
must be a cut separating t from r in which all edges of E' have capacity at most R p ^ t ' 1 . But since 
E' supports a flow of R p ^ from t to r, it must use at least R edges (counting with multiplicity); this 
implies that the cost of E' is at least R> (C + l)m 2 , a contradiction. 

We remark that a similar reduction also proves fi(loglogn) hardness for the single-pair Cap-SNDP 
problem without multiple copies: One can effectively encode an instance of the single-source Fixed- 
Charge Network Flow (FCNF, [9]), very similar to single-source Cap-SNDP with multiple copies, as 
an instance of single-pair Cap-SNDP without multiple copies: Create a new sink t*, and connect t* 
to each original terminal t with a single edge of cost and capacity Rt r - The only way to send flow 
^2 te x Rtr flow from t* to the source s is for each terminal t to send Rt r to s. Thus, Theorem 11.31 is a 
simple consequence of the £7 (log log n) hardness for single-source FCNF |9j. 

4 It is easy to see that a minimum-cost subgraph containing such a path for each terminal is a tree; given any cycle, 
one can remove the edge of lowest priority. 
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B Omitted Proofs 



B.l Proof of Theorem ED Near Uniform Cap-SNDP 

The algorithm described in Section [2] can be extended to the case where requirements are nearly 
uniform, that is, if R pq £ [R,~fR] for all pairs (p,q) G V x V. We obtain an 0(7logn)-approximation, 
while increasing the running time by a factor of 0(n 47 ). We work with a similar LP relaxation; for 
each set S C 2 V , we use R(S) = max. p£ s,q^s{Rpq} to denote the requirement of S. Now, the original 
constraints are of the form 

u{e)x e > R{S) 

eES(S) 

for each set S, and we define the residual requirement for a set as R(S, A) = min{0, R(S)—u(A(l5(S))}. 
The KC inequalities use this new definition of R(S, A). 

Given a fractional solution x to the KC LP, we modify the definitions of highly fractional and 
nearly integral edges: An edge e is said to be nearly integral if x e > 407 | og n , and highly fractional 
otherwise. Again, for a fractional solution x, we let A x denote the set of nearly integral edges; the set 
S of small cuts is now {S C V: u(S(S)) < 2~/R}. From the cut-counting theorem, |«S| < n 4 " 7 . We use 
C to denote the set of large cuts, the sets {S C V : u(5(S)) > 2jR}. 

As before, a fractional solution x is good if the original constraints are satisfied, and the KC 
Inequalities are satisfied for the set of edges A x and the sets in S. These constraints can be checked in 
time 0{n A " f+2 log 2 n), so following the proof of Lemma EH for constant 7, we can find a good fractional 
solution in polynomial time. 

The rounding and analysis proceed precisely as before: For each highly fractional edge e, we select 
it for the final solution with probability 407logra ■ x e . The expected cost of this solution is at most 
0(7 log n) times that of the optimal integral solution, and analogously to the proofs of Lemmas 12.61 
and 12. 7( one can show that the solution satisfies all cuts with high probability. This completes the 
proof of Theorem 11.11 



B.2 Proof of Theorem Q 

To prove Theorem 11.21 we work with the generalization of (|KC LP| ) given below. For any z-way cut C 
and for any set of edges A, we use R(C, A) to be max{0, Ri — u(A n d(C) }H 

min c(e)x e (/c-way KC LP) 

e&E 

Vi, Vi-way cuts C, u(e)x e > Ri (Original Constraints) 

ee<5(C) 

MA C E,Vi,Vi-way cuts C, ^ min{u(e), R(C, A)}x e > R(C, A) (KC-inequalities) 

ee<5(C)\A 

Ve € E, < x e < 1 



As before, given a fractional solution x to this LP, we define A x (the set of nearly integral edges) 
to be {e € E: x e > 40k \ ogn }- Define u(e) = u(e)x e to be the fractional capacity on the edges. Let 

5 For ease of notation, we assume that for any edge e, u(e) < Ri. This is not without loss of generality, but the proof 
can be trivially generalized: In the constraint for each i + 1-way cut C such that e £ 5(C), simply use the minimum of 
u(e) and Ri. 
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<Sj := {C : C is an i + 1-way cut and u(5(C)) < 2R{\. The solution x is said to be good if it satisfies 
the following three conditions: 

(a) If the capacity of e is u(e), the capacity of any i + 1-way cut in G is at least Ri] equivalently x 
satisfies the original constraints. 

(b) The KC inequalities are satisfied for the set A x and the sets in fij, for each 1 < i < k — 1. Note 
that if (a) is satisfied, then by Lemma 12.81 |<Sj| < n 4 \ 

(c) X^ee_E c ( e ) x e ls a t most the value of the optimum solution to the linear program (fc-way KC LP). 

Following the proof of Lemma 12.41 it is straightforward to verify that there is a randomized algo- 
rithm that computes a good fractional solution with high probability in time. 

Once we have a good fractional solution, our algorithm is to select A x , the set of nearly integral 
edges, and to select each highly fractional edge e £ E\A X with probability 40/clogn- x e . If F* denotes 
the highly fractional edges that were selected, we return the solution A x U F*. As before, it is trivial 
to see that the expected cost of this solution is O(Adogn) times that of the optimal integral solution. 

We show below that for any i < k — 1, we satisfy all i + 1-way cuts with high probability; taking 
the union bound over the k — 1 choices of i yields the theorem. 

As in Lemmas 12.61 and 12.71 we separately consider the "large" and "small" i + 1-way cuts. First, 
consider any small cut C in Si . From the Chernoff bound (Lemma I2.5j) and the KC inequality for C 
and A x , it follows that the probability we fail to satisfy C is at most l/n 19k . From the cut-counting 
Lemma [222 there are at most n 4 * < n 4fc such small cuts, so we satisfy all the small i + 1 way cuts with 
probability at least 1 — ^tsf • 

For the large i + 1-way cuts C, we separately consider cuts of differing capacities. For each j ' > 2, 
let C(j) denote the i + 1-way cuts C such that jRi < u(C) < (j + l)Ri. Consider any cut C G Cj\ if 
u(A x n 5(C)) > Ri, then the cut C is clearly satisfied. Otherwise, u(8(C) \ A x ) > (j — l)i?j. But since 
we selected each edge e in 5(C) \ A x for F* with probability 40k log n ■ x e , the Chernoff bound implies 
that we do not satisfy C with probability at most nl9k \ j _ 1) ■ The cut-counting Lemma 12.81 implies there 
are most n 2 *«' +1 ) < n 2k ^ +1 ^ such cuts, so we fail to satisfy any cut in C(j) with probability at most 
n 3 . Taking the union bound over all j, the failure probability is at most 2n~ 13 . 
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